Method for transceiving media files and device for transmitting/receiving using same

ABSTRACT

A method and apparatus for transmitting and receiving an MPEG-2 TS-based media file are provided. In the method of transmitting an MPEG2 Transport Stream (TS)-based media file, a Presentation Time Stamp (PTS) of a Presentation Unit (PU) is mapped to a Media Presentation Time (MPT) for synchronization with other media with respect to at least one program element forming a program included in the media file; and synchronization information including the mapped media presentation time is transmitted together with the media file, wherein the media presentation time is provided for a first PU in a segment of the transmitted media file.

CROSS-REFERENCE TO RELATED APPLICATIONS

The present application claims priority under 35 U.S.C. 119 and 35U.S.C. 365 to U.S. provisional Application No. 61/365,746 (filed on Jul.19, 2010), U.S. provisional Application No. 61/453,948 (filed on Mar.17, 2011), and PCT Application No. PCT/KR2011/005307 (filed on Jul. 19,2011) which are hereby incorporated by reference in its entirety.

BACKGROUND

The present disclosure relates to a method and apparatus fortransmitting and receiving an MPEG-2 Transport Stream (TS)-based mediafile.

Nowadays, digital broadcasting such as terrestrial, cable or satellitebroadcasting, or Digital Multimedia Broadcasting (DMB) mostly streams AV(Audio/Video) contents using MPEG-2 TS.

In addition, a multimedia service, which provides contents over anInternet Protocol (IP) network as a main transport network, is activatedaccording to a rapid growth of the internet. The digital broadcasting isdeveloping towards the direction of requiring more traffic likestereoscopic 3-dimensional (3D) video broadcasting, Ultra HighDefinition (UHD) broadcasting, 3D multi-view video broadcasting orhologram broadcasting.

However it may be less efficient to transport content of higherresolution than the prior art HDTV using MPEG-2 TS having a sequence offixed length packets of 188 bytes over the IP network.

SUMMARY

Embodiments provide a method and apparatus for transmitting andreceiving an MPEG-2 TS-based media file for playing the same back insynchronization with another media.

In one embodiment, a method of transmitting an MPEG2 TS-based mediafile, the method includes: mapping a Presentation Time Stamp (PTS) of aPresentation Unit (PU) to a Media Presentation Time (MPT) forsynchronization with other media with respect to at least one programelement forming a program included in the media file; and transmittingsynchronization information including the mapped media presentation timetogether with the media file, wherein the media presentation time isprovided for a first PU in a segment of the transmitted media file.

In another embodiment, a method of receiving an MPEG2 TS-based mediafile, the method includes: receiving the media file with synchronizationinformation; extracting an MPT of a PU for at least one program elementforming a program included in the media file; and playing back the mediafile using the extracted media presentation time, wherein the MPT ismapped from a PTS of a first PU in a segment of the received media file.

In further another embodiment, an apparatus for transmitting an MPEG2TS-based media file, the apparatus includes: a controller mapping a PTSof a first PU among PUs included in each segment or a PTS of a first PUafter a PTS discontinuity occurs to an MPT for synchronization withother media for a program element forming a program included in themedia file; and a network interface unit transmitting synchronizationinformation including the mapped MPT together with the media file.

In still further another embodiment, an apparatus for receiving an MPEG2TS-based media file, the apparatus includes: a network interface unitreceiving the media file with synchronization information; a decodingunit decoding the received media file; and a synchronization unitsynchronizing the decoded media file with other media using an MPTincluded in the synchronization information, wherein the MPT is mappedfrom a PTS of a first PU in a segment of the received media file or aPTS of a first PU after a PTS discontinuity occurs.

According to embodiments, since MPEG-2 TS-based media file istransported with synchronization information including mediapresentation time, it is possible to randomly access to the media fileand to easily synchronize the media file to other media installed insidea receiving apparatus or attached externally and play the media file.

In addition, since media presentation time for a time point whendiscontinuity occurs at a Presentation Time Stamp (PTS) is provided to areceiving apparatus side, the media file can be played back insynchronization with the media presentation time even in a case wherethe PTS discontinuity occurs.

The details of one or more embodiments are set forth in the accompanyingdrawings and the description below. Other features will be apparent fromthe description and drawings, and from the claims.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a view illustrating a configuration of an ISO Base Media fileformat.

FIG. 2 is a view illustrating a structure of a MPEG2 PacketizedElementary Stream (PES) packet.

FIG. 3 is a view illustrating a structure of MPEG TS.

FIG. 4 is a graph illustrating a Presentation Time Stamp (PTS) of MPEG2Transport Stream (TS) vs. a Media Presentation Time (MPT).

FIG. 5 is a view illustrating an embodiment of a method of presentingany Presentation Unit (PU) included in MPEG2 TS.

FIG. 6 is a block diagram illustrating a schematic configuration of anapparatus for transmitting a media file according to an embodiment.

FIG. 7 is a flowchart illustrating a method of transmitting a media fileaccording to an embodiment.

FIG. 8 is a view illustrating an embodiment of a method of presentingany PU included in MPEG2 TS in synchronization with a media playtimeline.

FIGS. 9 and 10 are views illustrating configurations of a Segment IndexBox (SIB) including synchronization information.

FIGS. 11 and 12 are views illustrating configurations of an eXtensibleMarkup Language (XML) file including synchronization information.

FIG. 13 is view of illustrating an embodiment of a media file structuretransmitted from a transmitting apparatus according to an embodiment.

FIG. 14 is a block diagram illustrating a configuration of an apparatusfor receiving a media file according to an embodiment.

FIG. 15 is a flow chart illustrating a method of receiving a media fileaccording to an embodiment.

FIG. 16 is a view illustrating an embodiment of a segment list createdby using a Media Presentation Description (MPD).

FIG. 17 is a view illustrating an embodiment of a method of playing anMPEG2 TS-based media file in synchronization with a 3GPP media file.

FIGS. 18 and 19 are views illustrating embodiments of a method ofimplementing a media file using an MPD of MPEG Dynamic AdaptiveStreaming over HTTP (DASH).

FIG. 20 is a block diagram illustrating a configuration of an IPTVreceiver according to a first embodiment of the present invention.

FIG. 21 is a block diagram illustrating a configuration of an IPTVreceiver according to a second embodiment of the present invention.

DETAILED DESCRIPTION OF THE EMBODIMENTS

Reference will now be made in detail to the embodiments of the presentdisclosure, examples of which are illustrated in the accompanyingdrawings.

A method of and apparatus for transmitting and receiving MPEG-2Transport Stream (TS)-based media file according to an embodiment willbe described in detail with reference to the accompanying drawings. Theinvention may, however, be embodied in many different forms and shouldnot be construed as being limited to the embodiments set forth herein;rather, that alternate embodiments included in other retrogressiveinventions or falling within the spirit and scope of the presentdisclosure can easily be derived through adding, altering, and changing,and will fully convey the concept of the invention to those skilled inthe art.

FIG. 1 illustrates a configuration of an ISO Base Media file format. Inorder to stream the illustrated ISO Base Media file format, a“fragmented file” format may be adopted.

Referring to FIG. 1, the fragmented file may be formed by dividing mediatracks into plural fragments, each of which includes a movie fragment(moof) box and a media data (mdat) box.

The mdat box after the moof box includes fragment media data and thiscombined moof-mdat pair may form a single fragment.

In the fragmented ISO Base Media file format, general timing fields maybe provided a Sample Table (ST) box such as a decoding time-to-sample(stts) box or a composition time-to-sample (ctts) box. However since theabove described timing fields exist in moov (Movie) box, they are rarelyused in transporting timing information.

Meanwhile since the moov box is finely encoded, it is assumed all theprevious samples are valid to a client side, for example a receivingapparatus, in order to access to a desired sample. The above describedassumption is not proper to a streaming application.

A Movie Fragment Random Access (mfra) file positioned at the end of amedia file may be used to provide a reference to fragments includingrandom access points.

However the mfra box uses an absolute byte offset from the beginning ofa file referring to the fragments and is not fragmented. Therefore itcan provide timing information of a fragmented media file in a storedstate like a recording process but is not possibly used in streamingapplication.

Accordingly, the ISO Base Media file format as shown in FIG. 1 does notprovide timing information for streaming the fragmented tracks.

On the other hand, a segment is a unit that a media file is transmittedand received and a configuration thereof may be variously defined.

For example, in “3GPP Adaptive HTTP Streaming (AHS)”, “OIPF HTTPAdaptive Streaming (HAS)”, and “HTTP Streaming of MPEG media”, thesegment includes one or more movie fragments and each segment hasapproximately identical duration in one period. In “3GPP AHS”, theentire structure of the segment is defined and it is allowed to use thesegment as a fine download unit or to download a portion of the segmentusing a byte range.

A 3GPP file format may include a Segment Index (sidx) box which providesa track play start time for the tracks in the fragment in order toprovide timing information on lost tracks and the play times may berecovered from decoding times.

A “3GPP AHS” client may fetch a segment using an approximately createdtimeline and reconstruct a timing of each track using the sidx box. Oncethe entire segment or a portion of the segment is downloaded, the timinginformation may be included in the file in order to be used for playingafterward.

In “Microsoft Smooth Streaming (MS SS)”, the segment may be defined asone fragment, and the one fragment may include one track. The fragmentis a basic unit for downloading and the term ‘segment’ is not used.

The “MS SS” may provide a FragmentTime attribute for each fragmentwithin a ManifestResponse file and the information may be provided in aStreamFragmentElement field.

In the foregoing “MS SS”, since only one track exists in the trackfragment, a timing of the track fragment can be sufficiently representedby a single FragmentTime. The timing information may be requested foreach track in order to enable a start of a synchronized presentationafter a random access and a switch between replaceable presentations.

Storing of the “MS SS” media file is performed in a file in which thetiming information is stored, which is performed by storing the mediafile in the MS PIFF or including the timing information in a mfra box.

FIG. 2 illustrates an MPEG2 PES packet structure and FIG. 3 illustratesa MPEG2 TS structure.

Referring to FIG. 2, an MPEG2 video stream compressed by an MPEG encodermay be packetized into a PES packet having a predetermined size.

The PES packet includes a PES header and a PES packet data field and thePES header may include an optional PES HEADER. In addition, the optionalPES HEADER includes an optional field 510 and the optional field 510includes a PTS field and a DTS field.

The PTS field may include Presentation Time Stamp (PTS) information andthe DTS field may have Decoding Time Stamp (DTS) information.

Synchronization between a video stream and an audio stream may beachieved with Time Stamp, which is time information representing when topresent each decoding unit called an access unit of the video and audiostreams.

For example, a PTS in the audio stream and a PTS and a DTS in videostream may be provided as the time information. The video and audiostreams may be synchronized with each other by comparing the foregoingtime stamp and a System Time Clock (STC) which is a time reference of adecoder and determining whether to present them.

Referring to FIG. 3, a transport packet of MPEG2 TS includes 4 byteheader and a following 184 byte payload and has total 188 byte sizeregardless of an intended transport mechanism.

The payload may include a PES packet, a Program Association Table (PAT),a Program Map Table (PMT) or the like, and the header may include aplurality of fields such as a sync byte and an adaptation field.

The adaptation field may include an optional field and the optionalfield may include a Program Clock Reference (PCR) field to which PCRinformation of time reference information is written.

The PCR is inserted to a header of MPEG2 TS for synchronization betweenan encoder and a decoder as a reference clock for PTS and STS.

The PTS is used for synchronized-presentation of element streamsincluded in one program, but does not have any relationship ofsynchronization with an external clock such as the Universal TimeCoordinated (UTC).

In addition, continuity of the PTS is not secured, so discontinuity orreset may occur in the PTS.

For example, in an application field of MPEG2 TS such as digital videobroadcasting, the foregoing discontinuity of the PTS may be delivered bythe PCR and a discontinuity indicator as shown in FIG. 3 immediatelybefore the corresponding event occurs.

The PTS may be used for synchronized presentation of one program, but isnot sufficient to be used in presentation in synchronization with ageneral presentation timeline, for example, a media presentationtimeline used in a HTTP streaming service.

In order to use the PTS in synchronization with the general presentationtimeline, initial synchronization, synchronization after a random accessand synchronization after the PTS discontinuity are needed to beconsidered.

FIG. 4 illustrates a graph of relationship between PTS of MPEG2 TS andMedia Presentation Time (MPT).

In FIG. 4, a solid line indicates a media presentation timeline withwhich the presentations in all the media are synchronized, and a dashedline indicates a PTS timeline representing a local presentation timewithin a program.

The MPT indicates times within the corresponding file as defined in theISO Base Media file format and starts from ‘0’.

The PTS may be represented as a time-scaled PTS timeline that a valuemeasured in 90 KHz clock is represented as a 33 bit size.

Referring to FIG. 4, the media presentation timeline may start frompresentation start offset provided in a manifest file such as MediaPresentation Description (MPD).

The PTS timeline may start from any point and have discontinuity d insome cases.

Other than ISO Base Media, MPEG TS allows direct fragmentation within arange where the fragment has a 188 byte size. MPEG2 TS-based “Apple HTTPStreaming” or “OIPF HAS” uses the same scheme as the one that the MPEG2TS is segmented into segments in a predetermined period.

In this case, the segment period is not generally precise for each mediacomponent. The media component of a program included in the MPEG2 TSdivides the MPEG2 TS into segments in a rough period using a PCR or aPTS as a reference, so the segment periods may vary among tracks.

“Apple HTTP Streaming” and “OIPF HAS” do not designate or refer toadditional ways representing timing information for MPEG2 TS, downloadsMPEG TS using a rough timeline and sets a start time of the segment tothe rough timeline.

Since “MS SS” is not related to delivery of MEG2 TS media files, it maybe difficult to directly apply “MS SS” to the transporting of the MPEG2TS. In addition, “MS SS” allows only one track for each fragment, so “MSSS” is not suitable to a transport stream having a plurality ofmultiplexed programs.

After downloading a portion of or the entire file, the correspondingfile is required to be stored as an MPEG TS file and the timinginformation is require to be valid for future playback. Accordinglyperfect synchronization with the media presentation timeline in theMPEG2 TS file transport using “MS SS” may be hard to achieve.

FIG. 5 illustrates an embodiment of a method of presenting anypresentation unit (PU) included in MPEG2 TS.

Referring to FIG. 5, MPEG2 TS packets respectively have a fixed 188 bytesize and may be transported in a segment unit including one or morepackets, for example, 3 MPEG2 TS packets.

Meanwhile, a client side, for example, a receiving apparatus may try toreceive any segment, synchronize any presentation unit PU_(y) with themedia presentation line and present PU_(y).

At this time, a PTS discontinuity may occur in a previous segment of asegment including the presentation unit PU_(y) to be presented and thusthe corresponding segment may not be received to be valid due to the PTSdiscontinuity in a random access case.

That is, in a MPEG2 TS-based media file, precise media presentation timeinformation for a PU is not provided for a random access or a continuousaccess in a segment. This may cause more limitation in the PTSdiscontinuity case.

When the MPEG2 TS is not synchronized with the media presentation time,internally stored or streamed MPECG2 TS-based media file and othermedia, for example, ISO Bas Media file format media or another MPEGTS-based media are not synchronized with each other and the media filemay not be played back.

According to an embodiment, MPEG2 TS media file may be synchronized withother media by signaling to map a timeline of MPEG2 TS-based media fileto the media presentation time.

For example, a transmitting apparatus according to an embodiment may mapa PTS of a PU for at least one program element forming a programincluded in the media file to the media presentation time andsynchronization information (abbreviated as sync information) includingthe mapped media presentation time may be transported with the mappedmedia presentation time.

FIG. 6 illustrates a block diagram of a schematic configuration of amedia file transmitting apparatus according to an embodiment. Theillustrated transmitting apparatus may include a controller 110 and anetwork interface unit 120.

Referring to FIG. 6, the controller 110 may generate sync information tobe provided to a client side, for example, a receiving apparatus, bymapping a PTS of a PU for a program element forming a program includedin an MPEG2 TS-based media file to a media presentation time.

The network interface unit 120 may transmits sync information includingthe mapped media presentation time to the receiving apparatus with theMPEG2 TS-based media file.

According to an embodiment, the sync information transmitted to theclient side may be provided to a first PU of each segment included inany one program element.

In this case, the sync information may be used to recover a presentationtime for all the PUs from the first PU of a specific segment to a nextPTS discontinuity occurring point included in MPEG2 TS.

In addition, the sync information may be provided for a first PU afterthe PTS discontinuity occur in the segment, and the sync information maybe used to recover the presentation time for all the PUs from the PTSdiscontinuity occurring point to a next PTS discontinuity occurringpoint of a next PTS discontinuity included in an MPEG2 TS.

Meanwhile, the PTS discontinuity does not occur any more, the syncinformation occurring point may be used to recover a presentation timefor all the PUs from the PTS discontinuity occurring point to an end ofthe corresponding segment included in the MPEG2 TS.

The MPEG2 TS-based playback may include a plurality of program elementsbelonging to one program and the sync information for any one of theprogram elements may be transported to the client side.

That is, when any one program element is synchronized with the mediapresentation time, the rest of the program elements may be synchronizedwith one program using a synchronization mechanism provided by the MPEG2TS.

Hereinafter a method of transmitting a media file according toembodiments will be described with reference to FIGS. 7 to 13.

FIG. 7 is a flowchart illustrating a method of transmitting a media fileaccording to an embodiment. The illustrated method of transmitting amedia file will be described in connection with the apparatus fortransmitting a media file according to the embodiment illustrated inFIG. 6.

Referring to FIG. 7, the controller 100 of the transmitting apparatus100 selects one of program elements included in a media file to betransmitted (operation S200).

For example, the controller 100 may select video tracks among videotracks and audio tracks forming the program as a reference programelement.

Then the controller 110 maps a PTS of a PU for the selected programelement to a media presentation time (operation S210).

Referring to FIG. 8, media presentation times, for example, MPT₁ toMPT_(n) for a first PU of a segment among PUs belonging to a specificprogram element may be transported to the client side as syncinformation.

Meanwhile, when a specific presentation unit PU_(y) is to be presented,a media presentation time MPT0 for PUy may be calculated by mapping aPTS of the PU as the following equation 1

MPT_(o)=((PTS₁−PTS_(k))/timescale_(PTS))*timescale+MPT_(n)  (1)

In equation 1, PTS₁ denotes a PTS of PU_(y), PTS_(k) denotes a PTS of apresentation unit PU_(N) positioned before the PU_(y), and MPT_(n)denotes a media presentation time for the PU_(x).

timescale_(PTS) denotes a clock of the PTS, for example 90 kHz, andtimescale denotes a clock of the MPT.

For example, the controller 110 of the transmitting apparatus 100recovers PTS_(s), namely, PTS₁ and PTS_(k) for PU_(y) and PU_(x)respectively, and calculates a next media presentation time MPT₀ bylinear interpolation like equation 1 using the recovered PTS₁ andPTS_(k).

As shown in FIG. 8, after the occurrence of the PTS discontinuity in thesegment, a media presentation time, for example, MPT_(m) for the firstPU may transported to the client side as sync information.

Accordingly, even in a case where the discontinuity occurs in a specificsegment, the media presentation time for the specific presentation unitmay be calculated by the foregoing linear interpolation with referenceto the equation 1.

As the above, a method of mapping the PTS of the presentation unit to amedia presentation time is described for an exemplary case where PU_(x)and PU_(y) are included in one program element with reference to FIG. 8.However in a case where the PU_(x) and PU_(y) may be respectivelyincluded to different program elements, a media presentation time for aspecific presentation unit may be calculated using the foregoing method.

The network interface unit 120 transports sync information including thecalculated media presentation times to the client side with the mediafile (operation S220).

According to the foregoing mapping scheme, a specific presentation unitmay be searched for on the basis of the PTS for a given mediapresentation time, which may be used in random access or streamswitching.

For example, PTS1 which is a PTS value for any media presentation timeMPT0 may be calculated as the following equation 2.

PTS₁=(MPT_(o)−MPT_(n)/timescale)*timescale_(PTS)+PTS_(k)  (2)

When the media presentation time is represented by a second unit, not bya clock, the equation 2 may be simplified as the following equation 3

PTS₁=(MPT_(o)−MPT_(n))*timescale_(PTS)+PTS_(k)  (3)

According to an embodiment of the present invention, the transmittingapparatus 100 transmits media presentation times (MP₁, MP_(m) andMPT_(n)) for the first PU or a first PU after the occurrence of the PTSdiscontinuity among the PUs included in the segment as shown in FIG. 8,or obtains a media presentation time MPT₀ for another PU according tothe foregoing mapping method and transports the obtained MPT to theclient side as the sync information.

Hereinafter a configuration of sync information transported to theclient side will be described in detail with reference to FIGS. 9 to 12.

The sync information may be structuralized as the following way.

At first, any one of a plurality of the program elements included in aprogram of an MPEG2 TS media file is selected as a reference programelement and a media presentation time for a first PU of the selectedreference program is provided to the client side.

In order to handle the PTS discontinuity included in a segment, aduration from the first PU of the selected reference program to a timepoint of a next PTS discontinuity occurrence (or to an end of thecorresponding segment when the PTS discontinuity does not occur) isprovided to the client.

When the PTS discontinuity occurs twice or more, the duration from anyone PTS discontinuity to a next PTS discontinuity (or to an end of thecorresponding segment when the PTS discontinuity does not occur) isprovided to the client side.

In addition, a PTS of a PU of the reference program element immediatelyafter the PTS discontinuity occurrence may be also provided to theclient side.

A media presentation time for all the PUs included in the segment may berecovered by using the sync information created as the forgoing.

However, in addition to signaled information for synchronizedpresentation as the foregoing, a duration before and after the PTSdiscontinuity may be more divided into smaller durations and thisadditional signaling may be used by the client side for a random accessor the like.

As described above, although a signaling of a presentation time for anyone program element in a program may be necessary and sufficient tosynchronize any program element with a media presentation timeline,syntax of the sync information may enable signaling of presentationtimes for a plurality of program elements.

This may be advantageous for the client to try to selectively presentprogram elements.

For example, when an MPEG2 TS program includes two audio tracks L1 andL2 of different languages, the client may want to playback only L1 trackbetween the audio tracks.

At this time, if signaling for the media presentation time is providedonly to L2, the client needs to extract PTS information from L1 torecover presentation times. However signaling for media presentationtime is provided to both L1 and L2, the client does not need to extractseparate PTS information to recover the presentation times.

In order to segment a transmitted MPEG2 TS media file, the media file isroughly segmented according to a reference track period as required froma transport protocol and boundaries of the segments may be adjusted toMPEG2 TS packets.

According to an embodiment, the foregoing sync information may berepresented as a binary coded format file or an XML file.

FIGS. 9 and 10 illustrate embodiments of a configuration of a segmentindex box including sync information, which is an example representingthe sync information in a binary format.

According to an embodiment, the sync information as described inrelation to FIGS. 6 to 8 may be represented in a binary format on thebasis of general ISO Base Media file format data types and has a similarstructure to a sidx box used in 3GPP AHS.

Referring to FIG. 9, the sync information may be formed of an MPEG2segment index box m2si, and the m2si box may be included one or more ina specific segment.

The m2si box may provide media presentation time for PUs included inMPEG2 TS with respect to one or more program elements of MPEG2 TS.

The segment may be subdivided into a plurality of MPEG2 TS packet groups(MTPGs) and the MTPGs may be formed of a set of adjacent MPEG2 TSpackets.

When one program element, for example, a video program element isselected as a reference program element, at least a first sample,namely, a presentation time for a PU of the reference program elementmay be provided in a first loop.

A second loop may be performed on each MTPG included in a segmentstarting from the first MTPG. In the second loop, a presentationduration of a reference program element of each MTPG may be providedwith byte offsets from a first byte of an MPEG2 TS to a correspondingMTPG in the segment.

The presentation duration of the MTPG may be a sum of presentationdurations of all the PUs of the reference program elements belong to theMTPG.

In the second loop, information on whether a random access point (RAP)is included in a reference program element of an MTPG may be provided.When the RAP is included, a presentation time offset from a presentationtime for a first sample in the corresponding segment may be moreprovided.

Other than sidx box of 3GPP AHS, the m2si box may be positioned at astart portion of the segment and does not allow a hierarchicalstructure, so that the m2si box may be easily removed.

Referring to a syntax structure as shown in FIG. 10, flags may form a24-bit integer flag and defined as follows.

timescale-present represents whether time scale of a media presentationtime exists in the corresponding box and a flag value thereof may be‘0x000001’.

pts-present represents whether a PTS of a first PU of a reference trackis included for all the written MTPGs and a flag value thereof may be‘0x000002’.

timescale is a time scale of media presentation timeline. When thetimescale is provided one or more in one media presentation, a valuethereof may be identical for all the presentations.

reference-PID provides packet identifiers (PIDs) of MPEG2 TS packetstransporting a reference program element.

program_element_count represents the number of program elements indexedin a next loop and may be designated 1 or a greater value.

referenc_count represents the number of program elements indexed in asecond loop and may have 1 or a greater value.

PID represents PIDs of MPEG2 TS packets transporting program elementstherein to which representation_time is provided and a PID in thecorresponding loop is identical to the reference_PID.

presentation_time may represent a presentation time for a first samplein a program element identified by the PID expressed in the timescale.

Reference_offset may represent a byte distance from a first byte of anMPEG2 TS segment to a first byte of an MPTG referred by repeating of thecorresponding loop.

MTPG_duration provides a presentation period of a reference programelement for the referred MPTG and may be represented in a time scale ofa corresponding track.

PTS may represent an MPEG2 TS PTS of a first PU of a reference programelement of the corresponding MPTG.

contains_RAP may be set to ‘1’ in order to represent that the referenceprogram element in the referred MPTG includes a RAP.

discontinuity may be set to ‘1’ in order to represent that thecorresponding MPTG starts from discontinuity.

RAP_delta_time may provide a presentation time offset of a RAP in orderto correspond to a presentation time for a first sample of the referenceprogram element in the segment, when contains_RAP is set to ‘1’. Whencontains_RAP is set to ‘0’, the RAP_delta_time may be maintained as ‘0’.

FIGS. 11 and 12 illustrate embodiments of a configuration of an XML fileincluding sync information.

Referring to FIG. 11, an XML file may include a media presentation timefor PUs of one or more program elements of an MPEG2 TS and the segmentmay be subdivided into adjacent MTPGs.

When one program element, for example, a video program element isselected as a reference program element, a presentation time for atleast a first sample, namely, a presentation unit of the referenceprogram element is provided in a first element.

In addition, a second element is for each MTPG included in a segmentstarting from a first MTPG. The second element may provide presentationperiods of a reference program element for each MTPG of the segment.

The presentation period of the MTPG may be a sum of presentation periodsof all the PUs of a reference program element belonging to thecorresponding group.

The second element may be provided information on whether the referenceprogram element of the MTPG includes a RAP. When the RAP is included, apresentation time offset may be further provided to a presentation timefor a first sample in the corresponding segment.

FIG. 11 illustrates a schema of sync information having a configurationas shown in FIG. 10 and the detailed description for the illustratedschema is the same as described above in relation to FIGS. 9 and 10.Therefore description thereof will be omitted.

FIG. 13 illustrates an embodiment of a file structure of a media filetransmitted from a transmitting apparatus according to an embodiment.

Referring to FIG. 13, an m2si box described in relation to FIGS. 9 and10 may be attached before an MPEG2 TS divided into a segment unit. TheMPEG2 TS data may be selectively encapsulated in a mdat box to make anaccess and demarcation easy.

As illustrated in FIG. 13, when the sync information is appended in abinary format, the m2si box may be attached before the mdat box.

The media file format may start from a segment type (styp) box, the stypbox may have a boxtype set as ‘styp’, a brand of ‘etsf’ and aminor_version set to ‘1’. Except this, the styp box may have the samesyntax structure as a file type box (ftyp) defined in the ISO/IEC14496-12 ISO Base Media file format.

Hereinafter embodiments of a media file receiving method and apparatuswill be described with reference to FIGS. 14 and 15.

A receiving apparatus 300 according to an embodiment may receive a mediafile with sync information, extract a media presentation time of a PUfrom the received sync information and then playback the media fileusing the extracted media presentation time.

A configuration of the sync information received by the receivingapparatus 300 may be the same as described above in relation to FIGS. 6to 13. Therefore detailed description thereof will be omitted.

FIG. 14 illustrates a block diagram of a configuration of a media filereceiving apparatus according to an embodiment. The illustratedreceiving apparatus 300 may include a network interface unit 310, adecoding unit 320 and a synchronization unit 330.

Referring to FIG. 14, the network interface unit 310 of the receivingapparatus 300, for example, a HTTP interface, receives a media file withthe sync information and the decoding unit 320 may decode the receivedmedia file and output the decoded media file.

The synchronization unit 330 may synchronize the decoded media file witha media presentation timeline using a media presentation time includedin the sync information.

For example, the network interface unit 310 may receive sync informationhaving the configuration as described above in relation to FIGS. 6 to 13with an MPEG2 TS from the transmitting unit 100.

The decoding unit 320 may include a second decoder 322 for decoding theMPEG2 TS and the second decoder 322 may be a general MPEG2 TS decoder.

The second decoder 322 may receive to decode the MPEG2 TS from thenetwork interface unit 310 and output the decoded PUs and PTSs thereof.

The synchronization unit 330 may identify a presentation unit to which amedia presentation time is provided through the received syncinformation. For this, the network interface unit 310 recovers the PTSsof the PUs of which presentation times are provided and transfers therecovered PTSs to the synchronization unit 330 or receives thepresentation time and PTSs of the corresponding unit from thetransmitting apparatus 100.

For recovering the PTSs, elementary stream time recovery may be used.

The decoding unit 320 may further include a first decoder 321 to decodeanother media file received through the network interface unit 310, thefirst decoder 321 may be a decoder for recovering another format mediafile such as in a 3GPP file format or an MPEG2 TS decoder which is thesame as the second decoder 322.

The synchronization unit 330 may synchronize an MPEG2 TS-based mediafile output from the second decoder 322 with another media file outputfrom the first decoder 321 using decoding information transmitted fromthe transmitting apparatus 100, for example, media presentation timesfor the PUs.

The receiving apparatus 300 may playback the MPEG2 TS-based media filesynchronized by the synchronization unit 330 and another media filetogether.

According to another embodiment, the decoded MPEG2 TS may be stored in amemory (not shown) prepared in the receiving apparatus 300. The syncinformation included in the received m2si box or xml file may be alsostored in the receiving apparatus 300.

The stored sync information may be used later for synchronizedpresentation with other media components.

In addition, the byte offsets for the PTS discontinuities may be storedas attributes in the m2si box or the xml file including the syncinformation.

In this case, since all the segments of MPEG2 TS are combined, the byteoffsets provided in the synchronization box or file should correspond toa start of the MPEG2 TS file.

FIG. 16 illustrates an embodiment of a segment list created by using anMPD.

The client performing HTTP streaming may access to the MPD and createthe segment list as shown in FIG. 16 from the MPD at a specificclient-local time “NOW”.

“NOW” may represent a current value of a clock in the reference clientwhen an MPD instance is formed from the MPD.

The client that is not synchronized with a HTTP streaming server butsynchronized with the UTC may experience problems of accessing tosegments due to validity. HTTP streaming clients should synchronizetheir own clocks with a precise universal time.

FIG. 17 illustrates an embodiment of a method of synchronizing an MPEG2TS-based media file with a 3GPP media file and playing back.

Referring to FIG. 17, a specific PU PU_(y) included in an MPEG2 TS mediafile may be synchronized with a sample corresponding to the 3GPP mediafile and played back using the sync information as described above inrelation to FIGS. 6 to 12.

In the embodiment as shown in FIG. 17, the m2si box including MPDinformation, 3GPP file format information for segment and syncinformation may be given as follows.

MPD Information:

availabilityStartTime=2010-05-30T 09:30:10Z

SegmentInfo duration=PT2S (Segment boundary shown is for segment #32)

3GPP FF Information for segment:

time_scale=48000

track_ID 1=(Video)

decode_time=2974186

composition_offset=1602

track_ID 2=(Audio)

decode_time=2976000

m2si (corresponding)

PID=101

presentation_time=2977120

In this case, the UTC used for rendering PU_(y) which is a PU playedback according to the mapping method as described above in relation toFIGS. 5 to 8 may be calculated as the following equation 4.

PU_(y)=(((PTS₁−PTS_(k))/time_scale_ext)+presentation_time/time_scale)+availabilityStartTime  (4)

In equation 4, the UTC “2010-05-30T 09:31:02.07Z” used for renderingPU_(y) may be obtained by substituting PTS₁ with PTS “43249500” ofPU_(y), PTS_(k) with PTS “43245300” of a PU to which media presentationtime “29977120” is provided, time_scale_ext with clock “90000” of PTS,time_scale with “48000” given in 3GPP file format information, andavailabilityStartTime with “2010-05-30T 09:30:10Z” given in the MDPinformation.

FIGS. 18 and 19 illustrate embodiments of a method of implementing amedia file using a MPD of MPEG Dynamic Adaptive Streaming over HTTP(DASH).

As shown in FIG. 18, an index segment which is sync information asdescribed above in relation to FIGS. 6 to 17 may exist separately inMPEG-DASH.

The media file may be implemented with syntax having the same media filestructure as the described above in relation to FIG. 13 and the indexdata may be stored after the MPEG2 TS media file. Therefore the numberof stored files may be greatly reduced.

In addition, the client may confirm how many bytes correspond to theindex data and how many tailings correspond to the media data in themedia file.

In this case, the client may recognize that a Universal Resource Locator(URL) for a media segment coincides with a URL of the index segmentcorresponding to the media segment. Accordingly the client may dividethe segments in combination with a request.

A general client may not use this optimization and may obtain the sameeffect through two ttp requests.

Another client which is not able to use the index data may neglect thesame and just fetch the media data.

A method of receiving and processing the MPEG2 TS-based media file asdescribed in relation to FIGS. 1 to 19 may be performed by an IPTVreceiver according to an embodiment.

For this, the IPTV receiver may receive a media file transmitted from atransmitting side, for example, a media server, and a format of themedia file may have a configuration as described in relation to FIGS. 6to 19.

The IPTV receiver may synchronize an MPEG2 TS with other media and playback using sync information received from the transmitting side.

The IPTV receiver may perform a method of receiving a media file asdescribed in relation to FIGS. 14 and 15.

Hereinafter, a configuration of the IPTV receiver according to anembodiment will be described in detail with reference to FIGS. 20 and21.

FIG. 20 illustrates a block diagram of an IPTV receiver configurationaccording to an embodiment.

An IPTV receiver according to an embodiment may include a separate tunerfor receiving terrestrial broadcasting, cable broadcasting or satellitebroadcasting. However, in this disclosure, the IPTV receiver having aconfiguration of receiving an IPTV service provided over an IP networkwill be mainly described for convenience.

An ITF indicates Open IPTV Terminal Function and may suggest a receiverformed of function modules necessary for supporting the IPTV service.

The IPTV receiver may include a network interface unit 502, a TCP/IPmanager 502, a service delivery manager 504, a PVR manager 505, ademultiplexer (Demux) 508, a data decoder 507, an audio/video decoder512, an display and OSD unit 515, an application manager 513 and 514, aservice information (SI) and Metadata database (DB) 511, a servicediscovery manager 509, a service control manager 503, a metadata manager510 and a content DB 506.

Referring to FIG. 20, the network interface unit 501 receives andtransmits packets from and to a network. Namely, the network interfaceunit 501 receives a service or media contents from a service providerover the network.

The TCP/IP manager 502 manages packet delivery from a source to adestination with respect to packets received by the IPTV receiver andpackets transmitted from the IPTV receiver. In addition, the TCP/IPmanager 502 classifies the received packets so as to correspond to aproper protocol, and outputs the classified packets to the servicediscovery manager 509, the service control manager 503 and the metadatamanager 510.

The service delivery manager 504 controls the received service data. Forexample, in a case of controlling real-time streaming, the servicedelivery manager 504 may use a RTP/RTCP.

When the real-time streaming data is transported using the RTP, theservice delivery manager 504 parses the received data packet accordingto the RTP and transports the parsed data packet to the Demux 508 orstores the same to the content DB 506 according to a control of theservice manager 514. In addition, the service delivery manager 504 feedsthe network reception information back to a server side providing theservice using the RTCP.

The Demux 508 demultiplexes the received packets into audio, video andProgram Specific Information data and transports the demultiplexed datarespectively to the audio/video decoder 512 and the data decoder 507.

The data decoder 507 decodes, for example, service information such asPSI. Namely, the data decoder 507 receives a PSI section, a program andService Information Protocol (PSIP) section or a DVB-SI sectiondemultiplexed by the Demux 508 and decodes the received sections.

The data decoder 507 decodes the received sections to make a databasefor service information and the database for service information isstored in the SI & Netadata DB 511.

The audio/video decoder 512 decodes video data and audio data receivedfrom the Demux 508. The video data and audio data decoded in theaudio/video decoder 512 are provided to a user through the display andOSD unit 515.

The application manager 513 and 514 manages the entire state of the IPTVreceiver, provides a user interface and manages other managers. Forthis, the application manager includes a user interface manager 513 anda service manager 514.

The user interface manager 513 provides a Graphic User Interface (GUI)for a user using an On Screen Display (OSD), and performs receiveroperations according to a key input from the user. For example, whenreceiving a key input for a channel selection from the user, the userinterface manager 513 transports the key input signal to the servicemanager 514.

The service manager 514 controls managers related to a service such asthe service delivery manager 504, the service discovery manager 509, theservice control manager 503 and the metadata manager 510.

The service manager 514 makes a channel map and selects a channel usingthe channel map according to the key input received from the userinterface manager 513. The service manager 514 receives channel serviceinformation for the data decoder 507 and sets audio and video PIDs ofthe selected channel to the Demux 508.

The service discovery manager 509 provides information necessary toselect a service provider which provides services. When receiving asignal for the channel selection from the service manager 514, theservice discovery manager 509 searches for a service using theinformation.

The service control manager 503 selects and controls the service. Forexample, the service control manager 503 uses the Internet GroupManagement Protocol (IGMP) or RTSP when the user selects a livebroadcasting service which has the same scheme as the prior art, orselects and controls the service using the RTSP when the user selects aservice of Video on Demand (VOD).

The RTP may provide a trick mode for the real time streaming. Inaddition, the service control manager 503 may initialize and managesessions using an IP Multimedia Subsystem (IMS) through an INC gateway.The above-described protocols are examples and may be replaced withother protocols.

The metadata manager 510 manages metadata associated with the servicesand stores the metadata in the SI and metadata DB 511.

The SI and metadata DB 511 and the content DB 506 may be respectivelyimplemented using a nonvolatile RAM (NVRAM) or a flash memory and mayalso be implemented in logically separated two regions on the samestorage area.

The PVR manager 505 is a module for recording and presenting livestreaming contents, and may collect metadata for the recorded contentand create additional information to be provided to the user such as athumbnail image or an index.

A function of a controller of the IPTV receiver according to anembodiment may be implemented as divided into a plurality of moduleslike the TCP/IP manager 502, the service delivery manager 504, a PVRmanager 505, the application manager 513 and 514, the service discoverymanager 509, the service control manager 503 and the metadata manager510 as shown in FIG. 20.

For example, the TCP/IP manager 502 may filter SD&S information usingthe foregoing target package information and control the networkinterface unit 501 to request only a payload or a segment correspondingto a specific package, for example, a package to which the IPTV receiveris subscribed, to a server and receive the same.

Alternatively, the TCP/IP manager 502 may filter SD&S informationreceived in a multicast scheme using the target package information andcause the decoder 507 to parse and process only a payload or a segmentcorresponding to a specific package.

FIG. 21 illustrates a functional block diagram of a configuration of theIPTV according to another embodiment. In FIG. 21, a solid line indicatesa data path and a dashed line indicates a control signal path.

A cable modem or a DSL modem, etc 601, which is an interface causing theITF to connect to an IP network in a physical level, demodulates asignal transmitted through a physical medium and recovers a digitalsignal.

An Ethernet NIC 602 recovers IP data from a signal received through thephysical interface. An IP network stack 607 processes each layeraccording to the IP protocol stack.

An XML parser 609 parses an XML document from among received IP data. Afile handler 608 processes data transported in a file type through FLUTEor the like from among the received IP data.

An SI handler 611 processes a portion corresponding to IPTV serviceinformation from among the receive file type data and stores theprocessed data in a storage 612. An EPG handler 610 processes a portioncorresponding to IPTV EPG information from among the receive file typedata and stores the processed data in the storage 612.

The storage 612 stores various data such as the SI data and the EPGinformation data and the like.

An SI decoder 613 reads the SI data from the storage 612, and then,analyzes and obtains channel map information. An EPG decoder 614analyzes EPG data stored in the storage 612 and recovers informationnecessary to form an EPG.

An ITF operation controller 615 is a main controller controlling an ITFoperation of a channel change or an EPG display.

A channel service manager 616 may perform an operation such as a channelchange according to a user input. An application manager 617 may performan application service such as an EPG display according to a user input.

An MPEG-2 demultiplexer 603 extracts MPEG2 TS data from a received IPdatagram and transports the extracted data to a corresponding moduleaccording to the PID.

An MPEG2 PSI/PSIP parser 604 extracts the PID of A/V data or PSI/PSIPdata including connection information on program elements, and parsesthe extracted data.

An A/V decoder 605 decodes input audio and video data and delivers thedecoded data to a display module 606. The display module 606 displaysthe decoded A/V data or application.

Hitherto an apparatus and method for transmitting and receiving mediadata according to embodiments has been described by exemplifying a casewhere a media file transmitted and received between the transmittingapparatus 100 and the receiving apparatus 300 includes MPEG2 TS data.However, the present disclosure is not limited thereto and may beapplied to various transport stream formats besides the MPEG2 TS.

The invention can also be embodied as computer readable codes on acomputer readable recording medium. The computer readable recordingmedium is any data storage device that can store data which can bethereafter read by a computer system. Examples of the computer readablerecording medium include read-only memory (ROM), random-access memory(RAM), CD-ROMs, magnetic tapes, floppy disks, and optical data storage.

The computer readable recording medium can also be distributed overnetwork coupled computer systems so that the computer readable code isstored and executed in a distributed fashion. Also, functional programs,codes, and code segments for accomplishing the present invention can beeasily construed by programmers skilled in the art to which the presentinvention pertains.

Although embodiments have been described with reference to a number ofillustrative embodiments thereof, it should be understood that numerousother modifications and embodiments can be devised by those skilled inthe art that will fall within the spirit and scope of the principles ofthis disclosure. More particularly, various variations and modificationsare possible in the component parts and/or arrangements of the subjectcombination arrangement within the scope of the disclosure, the drawingsand the appended claims. In addition to variations and modifications inthe component parts and/or arrangements, alternative uses will also beapparent to those skilled in the art.

1-27. (canceled)
 28. A method for transmitting a Moving Picture ExpertsGroup 2 Transport Stream (MPEG2 TS)-based media file by an apparatus fortransmitting the media file, the method comprising: mapping aPresentation Time Stamp (PTS) of a first Presentation Unit (PU) of MPEG2TS packets in each segment of the media file to a Media PresentationTime (MPT) of the first PU in a media presentation timeline forsynchronization of at least one program element forming a programincluded in the media file with other media included in other mediafile; and transmitting synchronization information including the mappedMPT of the first PU together with the segment including the MPEG2 TSpackets of the media file, wherein the synchronization information is asegment index box in International Organization for StandardizationBased Media File Format (ISOBMFF) for the MPEG2 TS packets, wherein thesynchronization information is positioned in front of the MPEG2 TSpackets in the segment of the media file, wherein the synchronizationinformation further includes timescale information representing a clockfrequency of the media presentation timeline and packet identifier (ID)information of the MPEG2 TS packets in the segment, wherein an MPT of asecond PU in the media file or the other media file is calculated byusing a PTS difference between the PTS of the first PU and a PTS of thesecond PU, a clock frequency of the PTS of which value is 90000 and theclock frequency of the media presentation timeline.
 29. The methodaccording to claim 28, wherein the mapping of the PTS is performed byusing a following equation,MPT_(o)=((PTS₁−PTS_(k))/timescale_(PTS))*timeseale+MPT_(n) where MPT₀denotes the MPT of the second PU, PTS₁ denotes the PTS of the second PU,PTS_(k) denotes the PTS of the first PU, timescale_(PTS) denotes theclock frequency of the PTS, timescale denotes the clock frequency of themedia presentation timeline, and MPT_(n) denotes an MPT for the firstPU.
 30. An apparatus for transmitting a Moving Picture Experts Group 2Transport Stream (MPEG2 TS)-based media file, the apparatus comprising:a controller to map a Presentation Time Stamp (PTS) of a firstPresentation Unit (PU) of MPEG2 TS packets in each segment of the mediafile to a Media Presentation Time (MPT) of the first PU in a mediapresentation timeline for synchronization of at least one programelement forming a program included in the media file with other mediaincluded in other media file; and a network interface to transmitsynchronization information including the mapped MPT of the first PUtogether with the segment including the MPEG2 TS packets of the mediafile, wherein the synchronization information is a segment index box inInternational Organization for Standardization Based Media File Format(ISOBMFF) for the MPEG2 TS packets, wherein the synchronizationinformation is positioned in front of the MPEG2 TS packets in thesegment of the media file, wherein the synchronization informationfurther includes timescale information representing a clock frequency ofthe media presentation timeline and packet identifier (ID) informationof the MPEG2 TS packets in the segment, wherein an MPT of a second PU inthe media file or the other media file is calculated by using a PTSdifference between the PTS of the first PU and a PTS of the second PU, aclock frequency of the PTS of which value is 90000 and the clockfrequency of the media presentation timeline.
 31. The apparatusaccording to claim 30, wherein the mapping of the PTS is performed byusing a following equation,MPT_(o)=((PTS₁−PTS_(k))/timescale_(PTS))*timescale+MPT_(n) where MPT₀denotes the MPT of the second PU, PTS₁ denotes the PTS of the second PU,PTS_(k) denotes the PTS of the first PU, timescale_(PTS) denotes theclock frequency of the PTS, timescale denotes the clock frequency of themedia presentation timeline, and MPT_(n) denotes an MPT for the firstPU.
 32. A method for receiving a Moving Picture Experts Group 2Transport Stream (MPEG2 TS)-based media file by an apparatus forreceiving the media file, the method comprising: receiving segmentsincluding the MPEG2 TS packets of the media file and synchronizationinformation including a Media Presentation Time (MPT) of a firstPresentation Unit (PU) of each of the segments, wherein the MPT is in amedia presentation timeline; and performing a synchronization of atleast one program element forming a program included in the media filewith other media included in other media file by using thesynchronization information, wherein the synchronization information isa segment index box in International Organization for StandardizationBased Media File Format (ISOBMFF) for the MPEG2 TS packets, wherein thesynchronization information further includes timescale informationrepresenting a clock frequency of the media presentation timeline andpacket identifier (ID) information of the MPEG2 TS packets in thesegment, and wherein an MPT of a second PU in the media file or theother media file is calculated by using a Presentation Time Stamp (PTS)difference between a PTS of the first PU and a PTS of the second PU, aclock frequency of the PTS of which value is 90000 and the clockfrequency of the media presentation timeline.
 33. The method accordingto claim 32, wherein the mapping of the PTS difference is performed byusing a following equation,MPT_(o)=((PTS₁−PTS_(k))/timescale_(PTS))*timescale+MPT_(n) where MPT₀denotes the MPT of the second PU, PTS₁ denotes the PTS of the second PU,PTS_(k) denotes the PTS of the first PU, timescale_(PTS) denotes theclock frequency of the PTS, timescale denotes the clock frequency of themedia presentation timeline, and MPT_(n) denotes an MPT for the firstPU.
 34. An apparatus for receiving a Moving Picture Experts Group 2Transport Stream (MPEG2 TS)-based media file, the apparatus comprising:a network interface to receive segments including the MPEG2 TS packetsof the media file and synchronization information including a MediaPresentation Time (MPT) of a first Presentation Unit (PU) of each of thesegments, wherein the MPT is in a media presentation timeline; and acontroller to perform a synchronization of at least one program elementforming a program included in the media file with other media includedin other media file by using the synchronization information, whereinthe synchronization information is a segment index box in InternationalOrganization for Standardization Based Media File Format (ISOBMFF) forthe MPEG2 TS packets, wherein the synchronization information furtherincludes timescale information representing a clock frequency of themedia presentation timeline and packet identifier (ID) information ofthe MPEG2 TS packets in the segment, and wherein an MPT of a second PUin the media file or the other media file is calculated by using aPresentation Time Stamp (PTS) difference between a PTS of the first PUand a PTS of the second PU, a clock frequency of the PTS of which valueis 90000 and the clock frequency of the media presentation timeline. 35.The apparatus according to claim 34, wherein the mapping of the PTSdifference is performed by using a following equation,MPT_(o)=((PTS₁−PTS_(k))/timescale_(PTS))*timescale+MPT_(n) where MPT₀denotes the MPT of the second PU, PTS₁ denotes the PTS of the second PU,PTS_(k) denotes the PTS of the first PU, timescale_(PTS) denotes theclock frequency of the PTS, timescale denotes the clock frequency of themedia presentation timeline, and MPT_(n) denotes an MPT for the firstPU.